Learning from Ontological Annotation: an Application of Formal Concept Analysis to Feature Construction in the Gene Ontology

نویسندگان

  • Elma Akand
  • Michael Bain
  • Mark Temple
چکیده

A key role for ontologies in bioinformatics is their use as a standardised, structured terminology, particularly to annotate the genes in a genome with functional and other properties. Since the output of many genome-scale experiments results in gene sets it is natural to ask if they share common function. A standard approach is to apply a statistical test for overrepresentation of ontological annotation, often within the Gene Ontology. In this paper we propose an alternative to the standard approach that avoids problems in over-representation analysis due to statistical dependencies between ontology categories. We use a feature construction approach to pre-process Gene Ontology annotation of gene sets and incorporate these features as input to a standard supervised machine learning algorithm. Our approach is shown to allow the straightforward use of an ontology in the context of data sourced from multiple experiments to learn a classifier predicting gene function as part of cellular response to an environmental stress.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

طراحی سامانه هوشمند ساخت هستان نگار به کمک شبکه عصبی ARTو روشC-value

In recent years, many efforts have been done to design ontology learning methods and automate ontology construction process. The ontology construction process is a time-consuming and costly procedure for almost all domains/applications, so automating this process is a solution to overcome the knowledge acquisition bottleneck in information systems and reduce the construction cost. In this artic...

متن کامل

Ontological Interoperability of Learning Objects: A Hybrid Graphical-Neural Approach

This paper presents OntoShare, an automated ontology mapping and merging architecture for learning object retrieval and reuse. The architecture aims to offer contextual and robust ontology mapping and merging through hybrid unsupervised clustering techniques comprising of Formal Concept Analysis (FCA), Self-Organizing Map (SOM) and K-Means clustering incorporated with linguistic processing usin...

متن کامل

Calculating Concept Similarity Heuristics For Ontology Learning from Text

We present experimental results of an approach to learning ontological concepts from text. The ontologicalsemantic analyzer OntoSem and its knowledge resources – in particular, its NLP-oriented ontology and semantic lexicon – are used to dynamically create the feature values on which our learning approach is based. We expand upon our previously reported work, with emphasis given to development ...

متن کامل

Ontology Population via NLP Techniques in Risk Management

One of the challenging tasks in the context of Ontological Engineering is to automatically or semi-automatically support the process of Ontology Learning and Ontology Population from semi-structured documents (texts). In this paper we describe a Semi-Automatic Ontology Instantiation method from natural language text, in the domain of Risk Management. This method is composed from three steps 1) ...

متن کامل

An Ontological Critique of Critical Geopolitics in Gearóid Ó Tuathail and Simon Dalby

Introduction: Philosophical research in the study of political geography in Iran is very limited. Among the existing cases, we encounter a kind of theoretical confusion that results from a lack of sufficient scientific and philosophical accuracy. Therefore, this study seeks to address the critical philosophical geopolitical critique in the writings of Ó Tuathail and Dalby by pointing out the ne...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007